Overview
Brought to you by YData
Dataset statistics
| Number of variables | 25 |
|---|---|
| Number of observations | 844.338 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 345.8 MiB |
| Average record size in memory | 429.4 B |
Variable types
| Numeric | 13 |
|---|---|
| DateTime | 3 |
| Categorical | 8 |
| Text | 1 |
assortment is highly overall correlated with store_type | High correlation |
competition_open_since_year is highly overall correlated with competition_time_month | High correlation |
competition_time_month is highly overall correlated with competition_open_since_year | High correlation |
month is highly overall correlated with week_of_year | High correlation |
promo2 is highly overall correlated with promo2_since_year and 1 other fields | High correlation |
promo2_since_year is highly overall correlated with promo2 and 2 other fields | High correlation |
promo_time_week is highly overall correlated with promo2 and 1 other fields | High correlation |
store_type is highly overall correlated with assortment | High correlation |
week_of_year is highly overall correlated with month | High correlation |
year is highly overall correlated with promo2_since_year | High correlation |
state_holiday is highly imbalanced (99.3%) | Imbalance |
week_of_year has 10126 (1.2%) zeros | Zeros |
competition_time_month has 268025 (31.7%) zeros | Zeros |
promo_time_week has 421646 (49.9%) zeros | Zeros |
Reproduction
| Analysis started | 2025-08-14 14:47:41.302011 |
|---|---|
| Analysis finished | 2025-08-14 14:49:07.829220 |
| Duration | 1 minute and 26.53 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
store
Real number (ℝ)
| Distinct | 1115 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 558.42137 |
| Minimum | 1 |
|---|---|
| Maximum | 1115 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 56 |
| Q1 | 280 |
| median | 558 |
| Q3 | 837 |
| 95-th percentile | 1060 |
| Maximum | 1115 |
| Range | 1114 |
| Interquartile range (IQR) | 557 |
Descriptive statistics
| Standard deviation | 321.73086 |
|---|---|
| Coefficient of variation (CV) | 0.57614353 |
| Kurtosis | -1.1988364 |
| Mean | 558.42137 |
| Median Absolute Deviation (MAD) | 278 |
| Skewness | 0.00042588538 |
| Sum | 4.7149639 × 108 |
| Variance | 103510.75 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 335 | 942 | 0.1% |
| 85 | 942 | 0.1% |
| 262 | 942 | 0.1% |
| 682 | 942 | 0.1% |
| 769 | 942 | 0.1% |
| 733 | 942 | 0.1% |
| 494 | 942 | 0.1% |
| 1097 | 942 | 0.1% |
| 562 | 942 | 0.1% |
| 423 | 942 | 0.1% |
| Other values (1105) | 834918 |
| Value | Count | Frequency (%) |
| 1 | 781 | |
| 2 | 784 | |
| 3 | 779 | |
| 4 | 784 | |
| 5 | 779 | |
| 6 | 780 | |
| 7 | 786 | |
| 8 | 784 | |
| 9 | 779 | |
| 10 | 784 |
| Value | Count | Frequency (%) |
| 1115 | 781 | |
| 1114 | 784 | |
| 1113 | 784 | |
| 1112 | 779 | |
| 1111 | 779 | |
| 1110 | 783 | |
| 1109 | 622 | |
| 1108 | 780 | |
| 1107 | 623 | |
| 1106 | 784 |
day_of_week
Real number (ℝ)
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.5203497 |
| Minimum | 1 |
|---|---|
| Maximum | 7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 7 |
| Range | 6 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.7237124 |
|---|---|
| Coefficient of variation (CV) | 0.48964237 |
| Kurtosis | -1.2593474 |
| Mean | 3.5203497 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.019309987 |
| Sum | 2972365 |
| Variance | 2.9711843 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 144052 | |
| 2 | 143955 | |
| 3 | 141922 | |
| 5 | 138633 | |
| 1 | 137557 | |
| 4 | 134626 | |
| 7 | 3593 | 0.4% |
| Value | Count | Frequency (%) |
| 1 | 137557 | |
| 2 | 143955 | |
| 3 | 141922 | |
| 4 | 134626 | |
| 5 | 138633 | |
| 6 | 144052 | |
| 7 | 3593 | 0.4% |
| Value | Count | Frequency (%) |
| 7 | 3593 | 0.4% |
| 6 | 144052 | |
| 5 | 138633 | |
| 4 | 134626 | |
| 3 | 141922 | |
| 2 | 143955 | |
| 1 | 137557 |
date
Date
| Distinct | 942 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 12.9 MiB |
| Minimum | 2013-01-01 00:00:00 |
|---|---|
| Maximum | 2015-07-31 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
sales
Real number (ℝ)
| Distinct | 21733 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6955.9591 |
| Minimum | 46 |
|---|---|
| Maximum | 41551 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.9 MiB |
Quantile statistics
| Minimum | 46 |
|---|---|
| 5-th percentile | 3174 |
| Q1 | 4859 |
| median | 6369 |
| Q3 | 8360 |
| 95-th percentile | 12668 |
| Maximum | 41551 |
| Range | 41505 |
| Interquartile range (IQR) | 3501 |
Descriptive statistics
| Standard deviation | 3103.8155 |
|---|---|
| Coefficient of variation (CV) | 0.44620957 |
| Kurtosis | 4.8540266 |
| Mean | 6955.9591 |
| Median Absolute Deviation (MAD) | 1694 |
| Skewness | 1.5949288 |
| Sum | 5.8731806 × 109 |
| Variance | 9633670.8 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5674 | 215 | < 0.1% |
| 5558 | 197 | < 0.1% |
| 5483 | 196 | < 0.1% |
| 6049 | 195 | < 0.1% |
| 6214 | 195 | < 0.1% |
| 5723 | 194 | < 0.1% |
| 5449 | 192 | < 0.1% |
| 5489 | 191 | < 0.1% |
| 5140 | 191 | < 0.1% |
| 5041 | 190 | < 0.1% |
| Other values (21723) | 842382 |
| Value | Count | Frequency (%) |
| 46 | 1 | |
| 124 | 1 | |
| 133 | 1 | |
| 286 | 1 | |
| 297 | 1 | |
| 316 | 1 | |
| 416 | 1 | |
| 506 | 1 | |
| 520 | 1 | |
| 530 | 1 |
| Value | Count | Frequency (%) |
| 41551 | 1 | |
| 38722 | 1 | |
| 38484 | 1 | |
| 38367 | 1 | |
| 38037 | 1 | |
| 38025 | 1 | |
| 37646 | 1 | |
| 37403 | 1 | |
| 37376 | 1 | |
| 37122 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 467463 | |
| 1 | 376875 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 467463 | |
| 1 | 376875 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 467463 | |
| 1 | 376875 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 844338 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 467463 | |
| 1 | 376875 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 844338 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 467463 | |
| 1 | 376875 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 844338 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 467463 | |
| 1 | 376875 |
state_holiday
Categorical
Imbalance 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 61.2 MiB |
| regular_day | |
|---|---|
| public_holiday | 694 |
| easter_holiday | 145 |
| christmas | 71 |
Length
| Max length | 14 |
|---|---|
| Median length | 11 |
| Mean length | 11.002813 |
| Min length | 9 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | regular_day |
|---|---|
| 2nd row | regular_day |
| 3rd row | regular_day |
| 4th row | regular_day |
| 5th row | regular_day |
Common Values
| Value | Count | Frequency (%) |
| regular_day | 843428 | |
| public_holiday | 694 | 0.1% |
| easter_holiday | 145 | < 0.1% |
| christmas | 71 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| regular_day | 843428 | |
| public_holiday | 694 | 0.1% |
| easter_holiday | 145 | < 0.1% |
| christmas | 71 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1687911 | |
| r | 1687072 | |
| l | 844961 | |
| y | 844267 | |
| d | 844267 | |
| _ | 844267 | |
| u | 844122 | |
| e | 843718 | |
| g | 843428 | |
| i | 1604 | < 0.1% |
| Other values (8) | 4476 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9290093 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 1687911 | |
| r | 1687072 | |
| l | 844961 | |
| y | 844267 | |
| d | 844267 | |
| _ | 844267 | |
| u | 844122 | |
| e | 843718 | |
| g | 843428 | |
| i | 1604 | < 0.1% |
| Other values (8) | 4476 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9290093 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 1687911 | |
| r | 1687072 | |
| l | 844961 | |
| y | 844267 | |
| d | 844267 | |
| _ | 844267 | |
| u | 844122 | |
| e | 843718 | |
| g | 843428 | |
| i | 1604 | < 0.1% |
| Other values (8) | 4476 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9290093 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 1687911 | |
| r | 1687072 | |
| l | 844961 | |
| y | 844267 | |
| d | 844267 | |
| _ | 844267 | |
| u | 844122 | |
| e | 843718 | |
| g | 843428 | |
| i | 1604 | < 0.1% |
| Other values (8) | 4476 | < 0.1% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 680893 | |
| 1 | 163445 | 19.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 680893 | |
| 1 | 163445 | 19.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 680893 | |
| 1 | 163445 | 19.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 844338 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 680893 | |
| 1 | 163445 | 19.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 844338 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 680893 | |
| 1 | 163445 | 19.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 844338 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 680893 | |
| 1 | 163445 | 19.4% |
store_type
Categorical
High correlation 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.1 MiB |
| a | |
|---|---|
| d | |
| c | |
| b | 15560 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | c |
|---|---|
| 2nd row | a |
| 3rd row | a |
| 4th row | c |
| 5th row | a |
Common Values
| Value | Count | Frequency (%) |
| a | 457042 | |
| d | 258768 | |
| c | 112968 | 13.4% |
| b | 15560 | 1.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| a | 457042 | |
| d | 258768 | |
| c | 112968 | 13.4% |
| b | 15560 | 1.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 457042 | |
| d | 258768 | |
| c | 112968 | 13.4% |
| b | 15560 | 1.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 844338 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 457042 | |
| d | 258768 | |
| c | 112968 | 13.4% |
| b | 15560 | 1.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 844338 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 457042 | |
| d | 258768 | |
| c | 112968 | 13.4% |
| b | 15560 | 1.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 844338 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 457042 | |
| d | 258768 | |
| c | 112968 | 13.4% |
| b | 15560 | 1.8% |
assortment
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.5 MiB |
| basic | |
|---|---|
| extended | |
| extra | 8209 |
Length
| Max length | 8 |
|---|---|
| Median length | 5 |
| Mean length | 6.3901565 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | basic |
|---|---|
| 2nd row | basic |
| 3rd row | basic |
| 4th row | extended |
| 5th row | basic |
Common Values
| Value | Count | Frequency (%) |
| basic | 444875 | |
| extended | 391254 | |
| extra | 8209 | 1.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| basic | 444875 | |
| extended | 391254 | |
| extra | 8209 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1181971 | |
| d | 782508 | |
| a | 453084 | 8.4% |
| s | 444875 | 8.2% |
| b | 444875 | 8.2% |
| c | 444875 | 8.2% |
| i | 444875 | 8.2% |
| x | 399463 | 7.4% |
| t | 399463 | 7.4% |
| n | 391254 | 7.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5395452 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 1181971 | |
| d | 782508 | |
| a | 453084 | 8.4% |
| s | 444875 | 8.2% |
| b | 444875 | 8.2% |
| c | 444875 | 8.2% |
| i | 444875 | 8.2% |
| x | 399463 | 7.4% |
| t | 399463 | 7.4% |
| n | 391254 | 7.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5395452 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 1181971 | |
| d | 782508 | |
| a | 453084 | 8.4% |
| s | 444875 | 8.2% |
| b | 444875 | 8.2% |
| c | 444875 | 8.2% |
| i | 444875 | 8.2% |
| x | 399463 | 7.4% |
| t | 399463 | 7.4% |
| n | 391254 | 7.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5395452 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 1181971 | |
| d | 782508 | |
| a | 453084 | 8.4% |
| s | 444875 | 8.2% |
| b | 444875 | 8.2% |
| c | 444875 | 8.2% |
| i | 444875 | 8.2% |
| x | 399463 | 7.4% |
| t | 399463 | 7.4% |
| n | 391254 | 7.3% |
competition_distance
Real number (ℝ)
| Distinct | 655 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5961.8275 |
| Minimum | 20 |
|---|---|
| Maximum | 200000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.9 MiB |
Quantile statistics
| Minimum | 20 |
|---|---|
| 5-th percentile | 130 |
| Q1 | 710 |
| median | 2330 |
| Q3 | 6910 |
| 95-th percentile | 20930 |
| Maximum | 200000 |
| Range | 199980 |
| Interquartile range (IQR) | 6200 |
Descriptive statistics
| Standard deviation | 12592.181 |
|---|---|
| Coefficient of variation (CV) | 2.1121344 |
| Kurtosis | 145.28866 |
| Mean | 5961.8275 |
| Median Absolute Deviation (MAD) | 1980 |
| Skewness | 10.134908 |
| Sum | 5.0337975 × 109 |
| Variance | 1.5856303 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 250 | 9210 | 1.1% |
| 50 | 6249 | 0.7% |
| 350 | 6239 | 0.7% |
| 1200 | 6069 | 0.7% |
| 190 | 6066 | 0.7% |
| 90 | 5607 | 0.7% |
| 180 | 5421 | 0.6% |
| 330 | 5294 | 0.6% |
| 150 | 5292 | 0.6% |
| 140 | 4684 | 0.6% |
| Other values (645) | 784207 |
| Value | Count | Frequency (%) |
| 20 | 779 | 0.1% |
| 30 | 3115 | |
| 40 | 3888 | |
| 50 | 6249 | |
| 60 | 2342 | 0.3% |
| 70 | 3734 | |
| 80 | 2331 | 0.3% |
| 90 | 5607 | |
| 100 | 3900 | |
| 110 | 4514 |
| Value | Count | Frequency (%) |
| 200000 | 2186 | |
| 75860 | 887 | |
| 58260 | 885 | |
| 48330 | 784 | 0.1% |
| 46590 | 784 | 0.1% |
| 45740 | 780 | 0.1% |
| 44320 | 780 | 0.1% |
| 40860 | 881 | |
| 40540 | 780 | 0.1% |
| 38710 | 784 | 0.1% |
competition_open_since_month
Real number (ℝ)
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.7873553 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 4 |
| median | 7 |
| Q3 | 10 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.3099167 |
|---|---|
| Coefficient of variation (CV) | 0.48765926 |
| Kurtosis | -1.2318753 |
| Mean | 6.7873553 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.048451057 |
| Sum | 5730822 |
| Variance | 10.955548 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 112179 | |
| 4 | 98204 | |
| 11 | 86359 | |
| 3 | 80052 | |
| 7 | 76226 | |
| 12 | 63968 | |
| 6 | 63913 | |
| 10 | 63216 | |
| 5 | 58271 | |
| 2 | 56895 | |
| Other values (2) | 85055 |
| Value | Count | Frequency (%) |
| 1 | 37733 | 4.5% |
| 2 | 56895 | |
| 3 | 80052 | |
| 4 | 98204 | |
| 5 | 58271 | |
| 6 | 63913 | |
| 7 | 76226 | |
| 8 | 47322 | |
| 9 | 112179 | |
| 10 | 63216 |
| Value | Count | Frequency (%) |
| 12 | 63968 | |
| 11 | 86359 | |
| 10 | 63216 | |
| 9 | 112179 | |
| 8 | 47322 | |
| 7 | 76226 | |
| 6 | 63913 | |
| 5 | 58271 | |
| 4 | 98204 | |
| 3 | 80052 |
competition_open_since_year
Real number (ℝ)
High correlation 
| Distinct | 23 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2010.3311 |
| Minimum | 1900 |
|---|---|
| Maximum | 2015 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.9 MiB |
Quantile statistics
| Minimum | 1900 |
|---|---|
| 5-th percentile | 2002 |
| Q1 | 2008 |
| median | 2012 |
| Q3 | 2014 |
| 95-th percentile | 2015 |
| Maximum | 2015 |
| Range | 115 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 5.5026278 |
|---|---|
| Coefficient of variation (CV) | 0.0027371749 |
| Kurtosis | 123.90308 |
| Mean | 2010.3311 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -7.2173228 |
| Sum | 1.6973989 × 109 |
| Variance | 30.278913 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2013 | 170465 | |
| 2014 | 151774 | |
| 2015 | 91118 | |
| 2012 | 61716 | 7.3% |
| 2005 | 46703 | 5.5% |
| 2010 | 42715 | 5.1% |
| 2011 | 41363 | 4.9% |
| 2009 | 40711 | 4.8% |
| 2008 | 40195 | 4.8% |
| 2007 | 36125 | 4.3% |
| Other values (13) | 121453 |
| Value | Count | Frequency (%) |
| 1900 | 622 | 0.1% |
| 1961 | 779 | 0.1% |
| 1990 | 3885 | 0.5% |
| 1994 | 1552 | 0.2% |
| 1995 | 1404 | 0.2% |
| 1998 | 766 | 0.1% |
| 1999 | 6213 | 0.7% |
| 2000 | 7631 | 0.9% |
| 2001 | 12157 | |
| 2002 | 20736 |
| Value | Count | Frequency (%) |
| 2015 | 91118 | |
| 2014 | 151774 | |
| 2013 | 170465 | |
| 2012 | 61716 | 7.3% |
| 2011 | 41363 | 4.9% |
| 2010 | 42715 | 5.1% |
| 2009 | 40711 | 4.8% |
| 2008 | 40195 | 4.8% |
| 2007 | 36125 | 4.3% |
| 2006 | 35543 | 4.2% |
promo2
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.1 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 423292 | |
| 1 | 421046 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 423292 | |
| 1 | 421046 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 423292 | |
| 1 | 421046 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 844338 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 423292 | |
| 1 | 421046 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 844338 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 423292 | |
| 1 | 421046 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 844338 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 423292 | |
| 1 | 421046 |
promo2_since_week
Real number (ℝ)
| Distinct | 52 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.629083 |
| Minimum | 1 |
|---|---|
| Maximum | 52 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 12 |
| median | 22 |
| Q3 | 37 |
| 95-th percentile | 47 |
| Maximum | 52 |
| Range | 51 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 14.288315 |
|---|---|
| Coefficient of variation (CV) | 0.60469188 |
| Kurtosis | -1.1948145 |
| Mean | 23.629083 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 0.17039865 |
| Sum | 19950933 |
| Variance | 204.15594 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 14 | 69320 | 8.2% |
| 40 | 56919 | 6.7% |
| 31 | 42369 | 5.0% |
| 10 | 42002 | 5.0% |
| 5 | 39506 | 4.7% |
| 1 | 34479 | 4.1% |
| 13 | 33878 | 4.0% |
| 37 | 33528 | 4.0% |
| 22 | 32208 | 3.8% |
| 18 | 30709 | 3.6% |
| Other values (42) | 429420 |
| Value | Count | Frequency (%) |
| 1 | 34479 | |
| 2 | 9644 | 1.1% |
| 3 | 9784 | 1.2% |
| 4 | 9778 | 1.2% |
| 5 | 39506 | |
| 6 | 10555 | 1.3% |
| 7 | 9776 | 1.2% |
| 8 | 9793 | 1.2% |
| 9 | 20107 | |
| 10 | 42002 |
| Value | Count | Frequency (%) |
| 52 | 4342 | 0.5% |
| 51 | 6424 | 0.8% |
| 50 | 7188 | 0.9% |
| 49 | 7030 | 0.8% |
| 48 | 13442 | |
| 47 | 6307 | 0.7% |
| 46 | 6408 | 0.8% |
| 45 | 30480 | |
| 44 | 8061 | 1.0% |
| 43 | 6428 | 0.8% |
promo2_since_year
Real number (ℝ)
High correlation 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2012.7979 |
| Minimum | 2009 |
|---|---|
| Maximum | 2015 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.9 MiB |
Quantile statistics
| Minimum | 2009 |
|---|---|
| 5-th percentile | 2009 |
| Q1 | 2012 |
| median | 2013 |
| Q3 | 2014 |
| 95-th percentile | 2015 |
| Maximum | 2015 |
| Range | 6 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.6601247 |
|---|---|
| Coefficient of variation (CV) | 0.0008247846 |
| Kurtosis | -0.19791125 |
| Mean | 2012.7979 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.78829569 |
| Sum | 1.6994818 × 109 |
| Variance | 2.7560141 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2013 | 257197 | |
| 2014 | 227630 | |
| 2015 | 103528 | |
| 2011 | 95035 | 11.3% |
| 2012 | 60712 | 7.2% |
| 2009 | 53824 | 6.4% |
| 2010 | 46412 | 5.5% |
| Value | Count | Frequency (%) |
| 2009 | 53824 | 6.4% |
| 2010 | 46412 | 5.5% |
| 2011 | 95035 | 11.3% |
| 2012 | 60712 | 7.2% |
| 2013 | 257197 | |
| 2014 | 227630 | |
| 2015 | 103528 |
| Value | Count | Frequency (%) |
| 2015 | 103528 | |
| 2014 | 227630 | |
| 2013 | 257197 | |
| 2012 | 60712 | 7.2% |
| 2011 | 95035 | 11.3% |
| 2010 | 46412 | 5.5% |
| 2009 | 53824 | 6.4% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 713533 | |
| 1 | 130805 | 15.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 713533 | |
| 1 | 130805 | 15.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 713533 | |
| 1 | 130805 | 15.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 844338 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 713533 | |
| 1 | 130805 | 15.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 844338 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 713533 | |
| 1 | 130805 | 15.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 844338 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 713533 | |
| 1 | 130805 | 15.5% |
year
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 55.6 MiB |
| 2013 | |
|---|---|
| 2014 | |
| 2015 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2015 |
|---|---|
| 2nd row | 2015 |
| 3rd row | 2015 |
| 4th row | 2015 |
| 5th row | 2015 |
Common Values
| Value | Count | Frequency (%) |
| 2013 | 337924 | |
| 2014 | 310385 | |
| 2015 | 196029 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2013 | 337924 | |
| 2014 | 310385 | |
| 2015 | 196029 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 844338 | |
| 0 | 844338 | |
| 1 | 844338 | |
| 3 | 337924 | |
| 4 | 310385 | 9.2% |
| 5 | 196029 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3377352 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 844338 | |
| 0 | 844338 | |
| 1 | 844338 | |
| 3 | 337924 | |
| 4 | 310385 | 9.2% |
| 5 | 196029 | 5.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3377352 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 844338 | |
| 0 | 844338 | |
| 1 | 844338 | |
| 3 | 337924 | |
| 4 | 310385 | 9.2% |
| 5 | 196029 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3377352 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 844338 | |
| 0 | 844338 | |
| 1 | 844338 | |
| 3 | 337924 | |
| 4 | 310385 | 9.2% |
| 5 | 196029 | 5.8% |
month
Real number (ℝ)
High correlation 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.8457738 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 6 |
| Q3 | 8 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.3239595 |
|---|---|
| Coefficient of variation (CV) | 0.56860898 |
| Kurtosis | -1.03319 |
| Mean | 5.8457738 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.25770643 |
| Sum | 4935809 |
| Variance | 11.048707 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 86335 | |
| 3 | 85975 | |
| 7 | 85576 | |
| 6 | 82571 | |
| 4 | 81726 | |
| 2 | 80239 | |
| 5 | 80099 | |
| 8 | 54411 | |
| 10 | 53291 | |
| 9 | 52321 | |
| Other values (2) | 101794 |
| Value | Count | Frequency (%) |
| 1 | 86335 | |
| 2 | 80239 | |
| 3 | 85975 | |
| 4 | 81726 | |
| 5 | 80099 | |
| 6 | 82571 | |
| 7 | 85576 | |
| 8 | 54411 | |
| 9 | 52321 | |
| 10 | 53291 |
| Value | Count | Frequency (%) |
| 12 | 50393 | |
| 11 | 51401 | |
| 10 | 53291 | |
| 9 | 52321 | |
| 8 | 54411 | |
| 7 | 85576 | |
| 6 | 82571 | |
| 5 | 80099 | |
| 4 | 81726 | |
| 3 | 85975 |
day
Real number (ℝ)
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.835706 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 16 |
| Q3 | 23 |
| 95-th percentile | 30 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.6833918 |
|---|---|
| Coefficient of variation (CV) | 0.54834259 |
| Kurtosis | -1.1796706 |
| Mean | 15.835706 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.011112159 |
| Sum | 13370688 |
| Variance | 75.401292 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 11 | 30119 | 3.6% |
| 4 | 29471 | 3.5% |
| 27 | 29270 | 3.5% |
| 13 | 29261 | 3.5% |
| 23 | 29239 | 3.5% |
| 2 | 29233 | 3.5% |
| 16 | 29202 | 3.5% |
| 18 | 29058 | 3.4% |
| 28 | 28365 | 3.4% |
| 7 | 28357 | 3.4% |
| Other values (21) | 552763 |
| Value | Count | Frequency (%) |
| 1 | 19366 | |
| 2 | 29233 | |
| 3 | 25056 | |
| 4 | 29471 | |
| 5 | 28172 | |
| 6 | 27566 | |
| 7 | 28357 | |
| 8 | 27959 | |
| 9 | 27067 | |
| 10 | 28156 |
| Value | Count | Frequency (%) |
| 31 | 15923 | |
| 30 | 26324 | |
| 29 | 23571 | |
| 28 | 28365 | |
| 27 | 29270 | |
| 26 | 26167 | |
| 25 | 27063 | |
| 24 | 28157 | |
| 23 | 29239 | |
| 22 | 27987 |
week_of_year
Real number (ℝ)
High correlation  Zeros 
| Distinct | 53 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.957035 |
| Minimum | 0 |
|---|---|
| Maximum | 52 |
| Zeros | 10126 |
| Zeros (%) | 1.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 11 |
| median | 22 |
| Q3 | 34 |
| 95-th percentile | 48 |
| Maximum | 52 |
| Range | 52 |
| Interquartile range (IQR) | 23 |
Descriptive statistics
| Standard deviation | 14.458681 |
|---|---|
| Coefficient of variation (CV) | 0.62981482 |
| Kurtosis | -1.0233089 |
| Mean | 22.957035 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 0.26702406 |
| Sum | 19383497 |
| Variance | 209.05345 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 25 | 20119 | 2.4% |
| 11 | 20098 | 2.4% |
| 8 | 20093 | 2.4% |
| 10 | 20079 | 2.4% |
| 5 | 20066 | 2.4% |
| 4 | 20063 | 2.4% |
| 7 | 20053 | 2.4% |
| 9 | 20051 | 2.4% |
| 3 | 20044 | 2.4% |
| 2 | 20040 | 2.4% |
| Other values (43) | 643632 |
| Value | Count | Frequency (%) |
| 0 | 10126 | |
| 1 | 19448 | |
| 2 | 20040 | |
| 3 | 20044 | |
| 4 | 20063 | |
| 5 | 20066 | |
| 6 | 20039 | |
| 7 | 20053 | |
| 8 | 20093 | |
| 9 | 20051 |
| Value | Count | Frequency (%) |
| 52 | 5035 | |
| 51 | 8319 | |
| 50 | 12355 | |
| 49 | 12333 | |
| 48 | 12334 | |
| 47 | 12334 | |
| 46 | 12182 | |
| 45 | 12333 | |
| 44 | 12334 | |
| 43 | 11042 |
year_week
Text
| Distinct | 137 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 58.0 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2015-30 |
|---|---|
| 2nd row | 2015-30 |
| 3rd row | 2015-30 |
| 4th row | 2015-30 |
| 5th row | 2015-30 |
| Value | Count | Frequency (%) |
| 2015-20 | 6722 | 0.8% |
| 2015-16 | 6722 | 0.8% |
| 2013-42 | 6721 | 0.8% |
| 2013-38 | 6721 | 0.8% |
| 2013-41 | 6721 | 0.8% |
| 2013-40 | 6721 | 0.8% |
| 2015-15 | 6720 | 0.8% |
| 2015-23 | 6720 | 0.8% |
| 2015-26 | 6720 | 0.8% |
| 2014-14 | 6718 | 0.8% |
| Other values (127) | 777132 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1126233 | |
| 1 | 1123367 | |
| 2 | 1123094 | |
| - | 844338 | |
| 3 | 545102 | |
| 4 | 515558 | |
| 5 | 305579 | 5.2% |
| 8 | 82840 | 1.4% |
| 6 | 82795 | 1.4% |
| 9 | 80848 | 1.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5910366 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1126233 | |
| 1 | 1123367 | |
| 2 | 1123094 | |
| - | 844338 | |
| 3 | 545102 | |
| 4 | 515558 | |
| 5 | 305579 | 5.2% |
| 8 | 82840 | 1.4% |
| 6 | 82795 | 1.4% |
| 9 | 80848 | 1.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5910366 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1126233 | |
| 1 | 1123367 | |
| 2 | 1123094 | |
| - | 844338 | |
| 3 | 545102 | |
| 4 | 515558 | |
| 5 | 305579 | 5.2% |
| 8 | 82840 | 1.4% |
| 6 | 82795 | 1.4% |
| 9 | 80848 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5910366 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1126233 | |
| 1 | 1123367 | |
| 2 | 1123094 | |
| - | 844338 | |
| 3 | 545102 | |
| 4 | 515558 | |
| 5 | 305579 | 5.2% |
| 8 | 82840 | 1.4% |
| 6 | 82795 | 1.4% |
| 9 | 80848 | 1.4% |
| Distinct | 173 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 12.9 MiB |
| Minimum | 1900-01-01 00:00:00 |
|---|---|
| Maximum | 2015-08-01 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
competition_time_month
Real number (ℝ)
High correlation  Zeros 
| Distinct | 376 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 41.679672 |
| Minimum | -32 |
|---|---|
| Maximum | 1407 |
| Zeros | 268025 |
| Zeros (%) | 31.7% |
| Negative | 70101 |
| Negative (%) | 8.3% |
| Memory size | 12.9 MiB |
Quantile statistics
| Minimum | -32 |
|---|---|
| 5-th percentile | -7 |
| Q1 | 0 |
| median | 16 |
| Q3 | 74 |
| 95-th percentile | 145 |
| Maximum | 1407 |
| Range | 1439 |
| Interquartile range (IQR) | 74 |
Descriptive statistics
| Standard deviation | 66.814412 |
|---|---|
| Coefficient of variation (CV) | 1.6030455 |
| Kurtosis | 126.85589 |
| Mean | 41.679672 |
| Median Absolute Deviation (MAD) | 17 |
| Skewness | 7.3388556 |
| Sum | 35191731 |
| Variance | 4464.1657 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 268025 | |
| 1 | 9476 | 1.1% |
| 7 | 5316 | 0.6% |
| 5 | 5234 | 0.6% |
| 4 | 5232 | 0.6% |
| 6 | 5214 | 0.6% |
| 9 | 5163 | 0.6% |
| 8 | 5147 | 0.6% |
| 10 | 5140 | 0.6% |
| 11 | 5038 | 0.6% |
| Other values (366) | 525353 |
| Value | Count | Frequency (%) |
| -32 | 30 | < 0.1% |
| -31 | 147 | < 0.1% |
| -30 | 323 | < 0.1% |
| -29 | 445 | 0.1% |
| -28 | 593 | |
| -27 | 772 | |
| -26 | 853 | |
| -25 | 896 | |
| -24 | 976 | |
| -23 | 1139 |
| Value | Count | Frequency (%) |
| 1407 | 5 | < 0.1% |
| 1406 | 25 | |
| 1405 | 25 | |
| 1404 | 23 | |
| 1403 | 23 | |
| 1402 | 26 | |
| 1401 | 26 | |
| 1400 | 21 | |
| 1393 | 23 | |
| 1392 | 24 |
promo_since
Date
| Distinct | 167 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 12.9 MiB |
| Minimum | 2009-07-27 00:00:00 |
|---|---|
| Maximum | 2015-07-27 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
promo_time_week
Real number (ℝ)
High correlation  Zeros 
| Distinct | 440 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 54.400699 |
| Minimum | -126 |
|---|---|
| Maximum | 313 |
| Zeros | 421646 |
| Zeros (%) | 49.9% |
| Negative | 57241 |
| Negative (%) | 6.8% |
| Memory size | 12.9 MiB |
Quantile statistics
| Minimum | -126 |
|---|---|
| 5-th percentile | -19 |
| Q1 | 0 |
| median | 0 |
| Q3 | 109 |
| 95-th percentile | 230 |
| Maximum | 313 |
| Range | 439 |
| Interquartile range (IQR) | 109 |
Descriptive statistics
| Standard deviation | 85.457559 |
|---|---|
| Coefficient of variation (CV) | 1.5708908 |
| Kurtosis | 0.1129961 |
| Mean | 54.400699 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.1033835 |
| Sum | 45932577 |
| Variance | 7302.9944 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 421646 | |
| 52 | 3910 | 0.5% |
| 98 | 1872 | 0.2% |
| 102 | 1847 | 0.2% |
| 97 | 1830 | 0.2% |
| 103 | 1828 | 0.2% |
| 101 | 1778 | 0.2% |
| 99 | 1777 | 0.2% |
| 94 | 1770 | 0.2% |
| 93 | 1764 | 0.2% |
| Other values (430) | 404316 |
| Value | Count | Frequency (%) |
| -126 | 12 | < 0.1% |
| -125 | 18 | < 0.1% |
| -124 | 18 | < 0.1% |
| -123 | 18 | < 0.1% |
| -122 | 18 | < 0.1% |
| -121 | 26 | |
| -120 | 30 | |
| -119 | 30 | |
| -118 | 30 | |
| -117 | 46 |
| Value | Count | Frequency (%) |
| 313 | 35 | < 0.1% |
| 312 | 42 | < 0.1% |
| 311 | 42 | < 0.1% |
| 310 | 42 | < 0.1% |
| 309 | 42 | < 0.1% |
| 308 | 42 | < 0.1% |
| 307 | 217 | |
| 306 | 252 | |
| 305 | 251 | |
| 304 | 251 |
Interactions
Correlations
| assortment | competition_distance | competition_open_since_month | competition_open_since_year | competition_time_month | day | day_of_week | is_promo | month | promo | promo2 | promo2_since_week | promo2_since_year | promo_time_week | sales | school_holiday | state_holiday | store | store_type | week_of_year | year | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| assortment | 1.000 | 0.066 | 0.061 | 0.080 | 0.070 | 0.000 | 0.149 | 0.005 | 0.006 | 0.013 | 0.016 | 0.092 | 0.111 | 0.098 | 0.094 | 0.004 | 0.068 | 0.115 | 0.538 | 0.006 | 0.007 |
| competition_distance | 0.066 | 1.000 | -0.024 | 0.008 | -0.007 | -0.000 | -0.000 | 0.071 | -0.000 | 0.003 | 0.162 | -0.014 | 0.030 | -0.031 | -0.038 | 0.003 | 0.010 | -0.045 | 0.046 | -0.000 | 0.003 |
| competition_open_since_month | 0.061 | -0.024 | 1.000 | -0.235 | 0.151 | -0.001 | -0.006 | 0.102 | 0.314 | 0.013 | 0.129 | 0.107 | 0.017 | -0.029 | -0.003 | 0.147 | 0.015 | -0.034 | 0.071 | 0.313 | 0.082 |
| competition_open_since_year | 0.080 | 0.008 | -0.235 | 1.000 | -0.930 | 0.001 | 0.001 | 0.032 | -0.035 | 0.000 | 0.054 | 0.017 | 0.031 | 0.045 | 0.040 | 0.001 | 0.002 | 0.002 | 0.055 | -0.034 | 0.005 |
| competition_time_month | 0.070 | -0.007 | 0.151 | -0.930 | 1.000 | 0.009 | -0.001 | 0.032 | 0.014 | 0.001 | 0.050 | -0.024 | 0.061 | 0.016 | -0.027 | 0.000 | 0.001 | -0.003 | 0.052 | 0.016 | 0.064 |
| day | 0.000 | -0.000 | -0.001 | 0.001 | 0.009 | 1.000 | 0.008 | 0.040 | -0.006 | 0.315 | 0.000 | 0.015 | 0.002 | 0.013 | -0.065 | 0.140 | 0.022 | -0.000 | 0.000 | 0.080 | 0.016 |
| day_of_week | 0.149 | -0.000 | -0.006 | 0.001 | -0.001 | 0.008 | 1.000 | 0.018 | -0.019 | 0.414 | 0.029 | -0.006 | 0.003 | -0.009 | -0.179 | 0.204 | 0.026 | 0.000 | 0.168 | -0.035 | 0.007 |
| is_promo | 0.005 | 0.071 | 0.102 | 0.032 | 0.032 | 0.040 | 0.018 | 1.000 | 0.233 | 0.005 | 0.429 | 0.172 | 0.301 | 0.394 | 0.054 | 0.029 | 0.007 | 0.038 | 0.045 | 0.185 | 0.033 |
| month | 0.006 | -0.000 | 0.314 | -0.035 | 0.014 | -0.006 | -0.019 | 0.233 | 1.000 | 0.041 | 0.029 | 0.473 | -0.069 | 0.020 | 0.062 | 0.411 | 0.035 | 0.001 | 0.007 | 0.996 | 0.263 |
| promo | 0.013 | 0.003 | 0.013 | 0.000 | 0.001 | 0.315 | 0.414 | 0.005 | 0.041 | 1.000 | 0.000 | 0.059 | 0.016 | 0.013 | 0.371 | 0.029 | 0.011 | 0.000 | 0.018 | 0.115 | 0.024 |
| promo2 | 0.016 | 0.162 | 0.129 | 0.054 | 0.050 | 0.000 | 0.029 | 0.429 | 0.029 | 0.000 | 1.000 | 0.280 | 0.683 | 0.909 | 0.119 | 0.008 | 0.010 | 0.072 | 0.108 | 0.028 | 0.031 |
| promo2_since_week | 0.092 | -0.014 | 0.107 | 0.017 | -0.024 | 0.015 | -0.006 | 0.172 | 0.473 | 0.059 | 0.280 | 1.000 | -0.121 | -0.041 | 0.080 | 0.208 | 0.026 | 0.005 | 0.074 | 0.474 | 0.133 |
| promo2_since_year | 0.111 | 0.030 | 0.017 | 0.031 | 0.061 | 0.002 | 0.003 | 0.301 | -0.069 | 0.016 | 0.683 | -0.121 | 1.000 | -0.793 | 0.086 | 0.028 | 0.008 | 0.008 | 0.086 | -0.066 | 0.614 |
| promo_time_week | 0.098 | -0.031 | -0.029 | 0.045 | 0.016 | 0.013 | -0.009 | 0.394 | 0.020 | 0.013 | 0.909 | -0.041 | -0.793 | 1.000 | -0.067 | 0.029 | 0.007 | -0.010 | 0.083 | 0.023 | 0.252 |
| sales | 0.094 | -0.038 | -0.003 | 0.040 | -0.027 | -0.065 | -0.179 | 0.054 | 0.062 | 0.371 | 0.119 | 0.080 | 0.086 | -0.067 | 1.000 | 0.038 | 0.055 | 0.001 | 0.112 | 0.060 | 0.034 |
| school_holiday | 0.004 | 0.003 | 0.147 | 0.001 | 0.000 | 0.140 | 0.204 | 0.029 | 0.411 | 0.029 | 0.008 | 0.208 | 0.028 | 0.029 | 0.038 | 1.000 | 0.032 | 0.000 | 0.005 | 0.382 | 0.045 |
| state_holiday | 0.068 | 0.010 | 0.015 | 0.002 | 0.001 | 0.022 | 0.026 | 0.007 | 0.035 | 0.011 | 0.010 | 0.026 | 0.008 | 0.007 | 0.055 | 0.032 | 1.000 | 0.007 | 0.071 | 0.033 | 0.004 |
| store | 0.115 | -0.045 | -0.034 | 0.002 | -0.003 | -0.000 | 0.000 | 0.038 | 0.001 | 0.000 | 0.072 | 0.005 | 0.008 | -0.010 | 0.001 | 0.000 | 0.007 | 1.000 | 0.098 | 0.001 | 0.005 |
| store_type | 0.538 | 0.046 | 0.071 | 0.055 | 0.052 | 0.000 | 0.168 | 0.045 | 0.007 | 0.018 | 0.108 | 0.074 | 0.086 | 0.083 | 0.112 | 0.005 | 0.071 | 0.098 | 1.000 | 0.007 | 0.010 |
| week_of_year | 0.006 | -0.000 | 0.313 | -0.034 | 0.016 | 0.080 | -0.035 | 0.185 | 0.996 | 0.115 | 0.028 | 0.474 | -0.066 | 0.023 | 0.060 | 0.382 | 0.033 | 0.001 | 0.007 | 1.000 | 0.250 |
| year | 0.007 | 0.003 | 0.082 | 0.005 | 0.064 | 0.016 | 0.007 | 0.033 | 0.263 | 0.024 | 0.031 | 0.133 | 0.614 | 0.252 | 0.034 | 0.045 | 0.004 | 0.005 | 0.010 | 0.250 | 1.000 |
Missing values
Sample
| store | day_of_week | date | sales | promo | state_holiday | school_holiday | store_type | assortment | competition_distance | competition_open_since_month | competition_open_since_year | promo2 | promo2_since_week | promo2_since_year | is_promo | year | month | day | week_of_year | year_week | competition_since | competition_time_month | promo_since | promo_time_week | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 5 | 2015-07-31 | 5263 | 1 | regular_day | 1 | c | basic | 1270.0 | 9 | 2008 | 0 | 31 | 2015 | 0 | 2015 | 7 | 31 | 30 | 2015-30 | 2008-09-01 | 84 | 2015-07-27 | 0 |
| 1 | 2 | 5 | 2015-07-31 | 6064 | 1 | regular_day | 1 | a | basic | 570.0 | 11 | 2007 | 1 | 13 | 2010 | 1 | 2015 | 7 | 31 | 30 | 2015-30 | 2007-11-01 | 94 | 2010-03-22 | 279 |
| 2 | 3 | 5 | 2015-07-31 | 8314 | 1 | regular_day | 1 | a | basic | 14130.0 | 12 | 2006 | 1 | 14 | 2011 | 1 | 2015 | 7 | 31 | 30 | 2015-30 | 2006-12-01 | 105 | 2011-03-28 | 226 |
| 3 | 4 | 5 | 2015-07-31 | 13995 | 1 | regular_day | 1 | c | extended | 620.0 | 9 | 2009 | 0 | 31 | 2015 | 0 | 2015 | 7 | 31 | 30 | 2015-30 | 2009-09-01 | 71 | 2015-07-27 | 0 |
| 4 | 5 | 5 | 2015-07-31 | 4822 | 1 | regular_day | 1 | a | basic | 29910.0 | 4 | 2015 | 0 | 31 | 2015 | 0 | 2015 | 7 | 31 | 30 | 2015-30 | 2015-04-01 | 4 | 2015-07-27 | 0 |
| 5 | 6 | 5 | 2015-07-31 | 5651 | 1 | regular_day | 1 | a | basic | 310.0 | 12 | 2013 | 0 | 31 | 2015 | 0 | 2015 | 7 | 31 | 30 | 2015-30 | 2013-12-01 | 20 | 2015-07-27 | 0 |
| 6 | 7 | 5 | 2015-07-31 | 15344 | 1 | regular_day | 1 | a | extended | 24000.0 | 4 | 2013 | 0 | 31 | 2015 | 0 | 2015 | 7 | 31 | 30 | 2015-30 | 2013-04-01 | 28 | 2015-07-27 | 0 |
| 7 | 8 | 5 | 2015-07-31 | 8492 | 1 | regular_day | 1 | a | basic | 7520.0 | 10 | 2014 | 0 | 31 | 2015 | 0 | 2015 | 7 | 31 | 30 | 2015-30 | 2014-10-01 | 10 | 2015-07-27 | 0 |
| 8 | 9 | 5 | 2015-07-31 | 8565 | 1 | regular_day | 1 | a | extended | 2030.0 | 8 | 2000 | 0 | 31 | 2015 | 0 | 2015 | 7 | 31 | 30 | 2015-30 | 2000-08-01 | 182 | 2015-07-27 | 0 |
| 9 | 10 | 5 | 2015-07-31 | 7185 | 1 | regular_day | 1 | a | basic | 3160.0 | 9 | 2009 | 0 | 31 | 2015 | 0 | 2015 | 7 | 31 | 30 | 2015-30 | 2009-09-01 | 71 | 2015-07-27 | 0 |
| store | day_of_week | date | sales | promo | state_holiday | school_holiday | store_type | assortment | competition_distance | competition_open_since_month | competition_open_since_year | promo2 | promo2_since_week | promo2_since_year | is_promo | year | month | day | week_of_year | year_week | competition_since | competition_time_month | promo_since | promo_time_week | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1016588 | 494 | 2 | 2013-01-01 | 3113 | 0 | public_holiday | 1 | b | basic | 1260.0 | 6 | 2011 | 0 | 1 | 2013 | 0 | 2013 | 1 | 1 | 0 | 2013-00 | 2011-06-01 | 19 | 2012-12-31 | 0 |
| 1016606 | 512 | 2 | 2013-01-01 | 2646 | 0 | public_holiday | 1 | b | extra | 590.0 | 1 | 2013 | 1 | 5 | 2013 | 0 | 2013 | 1 | 1 | 0 | 2013-00 | 2013-01-01 | 0 | 2013-01-28 | -4 |
| 1016624 | 530 | 2 | 2013-01-01 | 2907 | 0 | public_holiday | 1 | a | extended | 18160.0 | 1 | 2013 | 0 | 1 | 2013 | 0 | 2013 | 1 | 1 | 0 | 2013-00 | 2013-01-01 | 0 | 2012-12-31 | 0 |
| 1016656 | 562 | 2 | 2013-01-01 | 8498 | 0 | public_holiday | 1 | b | extended | 1210.0 | 1 | 2013 | 0 | 1 | 2013 | 0 | 2013 | 1 | 1 | 0 | 2013-00 | 2013-01-01 | 0 | 2012-12-31 | 0 |
| 1016770 | 676 | 2 | 2013-01-01 | 3821 | 0 | public_holiday | 1 | b | extra | 1410.0 | 9 | 2008 | 0 | 1 | 2013 | 0 | 2013 | 1 | 1 | 0 | 2013-00 | 2008-09-01 | 52 | 2012-12-31 | 0 |
| 1016776 | 682 | 2 | 2013-01-01 | 3375 | 0 | public_holiday | 1 | b | basic | 150.0 | 9 | 2006 | 0 | 1 | 2013 | 0 | 2013 | 1 | 1 | 0 | 2013-00 | 2006-09-01 | 77 | 2012-12-31 | 0 |
| 1016827 | 733 | 2 | 2013-01-01 | 10765 | 0 | public_holiday | 1 | b | extra | 860.0 | 10 | 1999 | 0 | 1 | 2013 | 0 | 2013 | 1 | 1 | 0 | 2013-00 | 1999-10-01 | 161 | 2012-12-31 | 0 |
| 1016863 | 769 | 2 | 2013-01-01 | 5035 | 0 | public_holiday | 1 | b | extra | 840.0 | 1 | 2013 | 1 | 48 | 2012 | 1 | 2013 | 1 | 1 | 0 | 2013-00 | 2013-01-01 | 0 | 2012-11-19 | 6 |
| 1017042 | 948 | 2 | 2013-01-01 | 4491 | 0 | public_holiday | 1 | b | extra | 1430.0 | 1 | 2013 | 0 | 1 | 2013 | 0 | 2013 | 1 | 1 | 0 | 2013-00 | 2013-01-01 | 0 | 2012-12-31 | 0 |
| 1017190 | 1097 | 2 | 2013-01-01 | 5961 | 0 | public_holiday | 1 | b | extra | 720.0 | 3 | 2002 | 0 | 1 | 2013 | 0 | 2013 | 1 | 1 | 0 | 2013-00 | 2002-03-01 | 131 | 2012-12-31 | 0 |